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Abstract. Using Stein's method techniques, we develop a framework which allows one to bound the error 
terms arising from approximation by the Laplace distribution and apply it to the study of random sums of 
mean zero random variables. As a corollary, we deduce a Berry-Esseen type theorem for the convergence of 
certain geometric sums. Our results make use of a second order characterizing equation and a distributional 
transformation which can be realized by iterating the X — P bias transformation introduced by Goldstein 
and Reinert. 

1. Background and Introduction 

Beginning with the publication of Charles Stein's seminal paper [1^] in 1972, probabilists and statis- 
ticians have developed a wide range of techniques based on characterizing equations for bounding the 
distance between the distribution of a random variable X and and that of a random variable Z having 
some specified target distribution. The metrics for which these techniques are applicable are of the form 
di-i{-^{X),^{Z)) — sup^jg^ \E[h{X)] — for some suitable class of functions 'H, and include as spe- 

cial cases the Wasserstein, Kolmogorov, and total variation distances. (The Kolmogorov distance gives the 
distance between the associated distribution functions, soH = {l(-oo,a](a^) '■ a G K}. The total variation 
and Wasserstein distances correspond to letting H consist of indicators of Borel sets and 1-Lipschitz func- 
tions, respectively.) The basic idea is to find an operator A such that E[{Af){X)] = for all / belonging 
to some sufhciently large class of functions T if and only if ^{X) = ^{Z). For example, Stein showed 
that E[{ANf){Z)] = E[Zf{Z) — <7^f'{Z)] — for all absolutely continuous functions / such that these 
expectations exist if and only ii Z ^ A^(0, cr^) [16], and his student Louis Chen showed shortly thereafter 
that Z - Poisson(A) if and only if E[{Apf )iZ)] ^ E[Zf{Z) - \f{Z + 1)] = for all functions / for which 
the expectations exist [2]. Similar characterizing operators have since been worked out for several other 
distributions [H [Til [1] [131 [T^ • Given such an operator A for ^{Z), one can consider the solution S to 
the equation {Af){x) = h{x) — E[h{Z)] for h G H. Taking expectations, absolute values, and suprema gives 

dn{^{X),^{Z)) = sup \E[h{X)] - E[h{Z)]\ - sup \E[{Ah){X)]\ . 
hen hen 

The intuition is that since E[{Af){Z)] = for / e J^, the distribution of X should be close to that of Z when 
E[{Af){X)] is close to zero. Remarkably, it is often easier to work with the right-hand side of the above 
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equation, and the tools for analyzing distances between distributions in this manner are collectively known 
as Stein's method. For more on this rich and fascinating subject, the authors recommend |17 1 ITS l l3l l4]. 

In this paper, we apply the above ideas to the (symmetric) Laplace distribution. For a S K, 6 > 0, a 
random variable W ~ Laplace(a, b) has cumulative distribution function 

he~b— w < a 



Fwiw;a,b) 

and density 



1— \e b ^ w > a 



fw{w;a,b) = ' " ' , u; G 




k is odd 
k is even 
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If ~ Laplace(0, b), then its moments are given by 

E[W''] = 

and its characteristic function is 

This distribution was introduced by P.S. Laplace in 1774, four years prior to his proposal of the "second law 
of errors," now known as the normal distribution. Though nowhere near as ubiquitous as its younger sibling, 
the Laplace distribution appears in numerous applications, including image and speech compression, options 
pricing, and modeling sizes of sand particles, diamonds, and beans. For more properties and applications of 
the Laplace distribution, the reader is referred to the text [9]. 

Our interest in the Laplace distribution lies primarily in the fact that if Xi,X2, ... is a sequence of random 
variables (satisfying certain assumptions) and Np ^ Geometric(p) is independent of the X^'s, then the sum 
pi X^i^i -^i converges weakly to the Laplace distribution as p — >■ [S]. Such geometric sums arise in a variety 
of settings [8] , and the general setup (distributional convergence of sums of random variables) is exactly the 
type of problem for which one expects Stein's method computations to yield useful results. Indeed, Erol Pekoz 
and Adrian RoUin have applied Stein's method arguments to generalize a theorem due to Renyi concerning 
the convergence of sums of a random number of positive random variables to the exponential distribution 
|12| . By an analogous line of reasoning, we are able to carry out a similar program for convergence of random 
sums of certain mean zero random variables to the Laplace distribution. 

We begin in section 2 by introducing a Stein equation which we show completely characterizes the mean 
zero Laplace distribution. In section 3, we use this characterization to construct a distributional transfor- 
mation which has the Laplace(0, b) law as a fixed point, show how this transformation can be used to bound 
the distance to the Laplace distribution, and establish sufficient conditions for a distribution to lie in the 
domain of this transformation. In section 4, we apply these tools to the study of random sums and deduce 
a Berry-Esseen type result for the convergence of certain geometric sums to the Laplace distribution. 

2. A Characterizing Equation 

The first order of business is to find a characterizing equation for the Laplace distribution and establish 
some properties of its solutions. We begin with the following lemma. 

Lemma 1. Suppose that W ^ Laplace{0,b), 6 > 0. Then if g is an absolutely continuous function such that 
g' is also absolutely continuous and E[g{W)] exists, we have 

E[giW)]~g{0)=b'E[g"iW)]. 



Proof. Applying Fubini's theorem twice shows that 



g"[x)e bdx= I g"{x)[ I ^dyldx 



^0 



g"{x)e dxdy 



(5'(2/) -.9'(0))e '^dy=- g'{y)e ' dy 



5'(0) 



e '' dy 



I 9'{v) i^j^ ^e"*^^) '^v - 5'(o) = hj^[ a'iy^'^'^yd^ - 5'(o) 

.9(0) 
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"^vii (3(^)-5(0))e '»dz-g'{0) = -^ I g{z)e i dz 



e^T^dz-g'iO) 



' g{z)e-~Uz^^-g'{0). 



(The fact that exists justifies the use of Fubini by reading the above backwards. 

Setting h{y) — g{~y), it follows from the previous calculation that 

g"{x)e^dx= / g" {-yje^T dy ^ / h" {y)e^T^ dy 



1 f°° 

— / h{z)e T^dz 
Jo 



1 



' g{^z)e-Uz~^+g'iO) 



1 /"^ 



m 



5'(0). 



Therefore, since the preceding equations show that 

1 



g {x)e<'dx + 



2h 



1 




26 













J^^g"{x)ebdx , g" {x)e ^ dx\ < oo, we see that 

"{x)e^^dx 
.9(0) . 



g(,)ef - ^ + .9'(0)) + (^1 g(z)e-f rfz - ^ - ^'(0)) 



5(0) 



-At^ I 9i^)e-^dz - 2^ ) = - iE[giW)] - g(0)) 
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□ 



Observe that if X is any random variable which has all of its moments and satisfies the equation in Lemma 
1, then, taking g{k) = x'' , we have E[X''] = b^k{k - l)E[X''-^] for all keN. Since E[X] = + b^E[0] = 
and £'[^2] = + 62£'[2] = 26^, this recursion implies that 

r fc, I 0, fc is odd 

^,,■.= E[x'] = \ I 

I o k!, A; is even 

for all fc G N, which is precisely the moment sequence for a Laplace(0, 6) random variable. Because (/i2fe)^ = 

6[(2fc)!]A < 6[(2fc)2'=]A = 26fc for all fc e N, we have J2T=i {jk:)'' - IbT.kLiI = so it follows 
from Carleman's criterion [S] that the Laplace(0, 6) distribution is the unique distribution having moments 
{fJ-k}kLi- Therefore, the equation in Lemma 1 completely characterizes the Laplace distribution amongst all 
distributions for which every moment exists. 
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Note also that since the density of a Laplace(0,6) random variable is given by fw{w) = (> , the 

density method [3J suggests the following Stein equation for the Laplace distribution: 

= g'{w) + {^'f^l giw) = g'{w) - \sgn{w)g{w), 
Jw[w) b 

and, indeed, one can verify that ii W ^ Laplace(0, b), then 

E[g'iW)] - lE[sgn{W)g{W)] 

for all absolutely continuous g for which these expectations exist. Thus if g' is such a function as well, setting 
G{w) = sgn(u;) (giw) — g(0)), we have 

EW'iW)] = lE[sgn{W)g'{W)] = ii?[G'(W-)] = ^E[sgn{W)GiW)] = 1 iE[giW)] - g(0)) , 

so the general form of the equation in Lemma 1 can be ascertained by iterating the density method. 

Now, in order to establish that the equation in Lemma 1 completely characterizes the Laplace(0, b) dis- 
tribution, we will show that if X is any random variable satisfying this equation, then 

dBL (^(X),Laplace(0,6)) = 

where dsL denotes the bounded Lipschitz distance given by 

dBL{^{X),^{Y))^ sup \E[h{X)] - E[h{Y)]\ , 
heUBL 

Ubl = {h:\\h\\^<l and \h{x) ~ h{y)\ <\x-y\ for aU x,y e M}. 

As d^L is a metric on the space of Borel probability measures on R |19j . this will establish the converse to 
Lemma 1. We begin by recording the following proposition relating dsL to the more familiar Kolmogorov 
distance 

dK {-^{X), ^{Y)) = sup \V{X <x}~ P{y <x}\. 



Proposition 1. If Z is an absolutely continuous random variable whose density, fz, is uniformly bounded 
by a constant C < oo, then for any random variable X , 

dK {^{X),J^{Z)) < ^l±^^dBL {J^{X),J^{Z)). 



Proof. Wc first note that the inequality holds trivially if (^(X), ^(Z)) = since dBL and dK are 

Z+; 
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metrics. Also, since dK{P, Q) < 1 for all probability measures P and Q, > 1. and dBL {-^{X), J^{Z)) > 



1 implies y/dBL {^{X),^{Z)) > 1, we have 



C + 2 



dK {^{X),^{Z)) < 1 < -^^dBL {J^{X),J^{Z)) 

whenever dBL > 1. As such, we may suppose that dBL {^{X),^{Z)) e (0, 1) 

Now, for a; e R, e > 0, write 



1, z < X 

hx{z) ^li^^,Az) ^ { ' ~ and = <( 1-^, ze{x,x + e\ 

\ Z > X 



1, Z < X 

- z e (a;,x 4 

0, z > X + e 



Then for all x S R, 

E[KiX) - KiZ)] = E[h,iX)] - E[h,^,{Z)] + E[h,^,{Z)] ~ E[h.,{Z)] 

< {E[h,AX)] - E[h,^,{Z)]) + (l ~ fz{z)dz 



Ce 

<\E[h,^,{X)]-E[h^^,{Z)]\ + —. 



Since dgh e (0,1), if we take e = ^(Ibl {^{X),^{Z)) e (0,1), then eK^e e Ubl and thus 

1 Ce 
E[h,{X) - K{Z)] < - \E[eh.,^,{X)] - E[eh,,,{Z)]\ + — 

1 Cf 

<-dBL mx),^{z)) + ^ 

C + 2 



WdBL (if(X),if(Z)). 
A similar argument using 

E[hjz) - ;i,(X)] = i?[/i,(^)] - E[K_,,,{Z)] + E[K_,JZ)] - E[K{X)] 

Ce 

< — + {E[h,_,,,{Z)] - E[h,_,,,{X)]) 



shows that 



C + 2 



\E[K{X)] - E[K{Z)]\ < -^^dBL (^(X),if(Z)) 



for all cc e M, and the proposition follows by taking suprema. □ 



Remark. When C > 1, we can take £ = y ^dBL in the above argument to obtain an improved 

bound of 



dK {^{X),^{Z)) < -^CdBL (if(X),if(Z)). 

To the best of the authors' knowledge, the above proposition is original, though the proof follows the same 
basic line of reasoning as the well-known bound on the Kolmogorov distance by the Wasserstein distance 
(see Proposition 1.2 in [TS]). It seems that the primary reason for using the Wasserstein metric, dw, is 
that it enables one to work with smoother functions while still implying convergence in the more natural 
Kolmogorov distance. Proposition 1 shows that dBL also upper-bounds dx while enjoying all of the resulting 
smoothness of dw and with additional boundedness properties to boot. Also, dBL is a fairly natural measure 
of distance since it metrizes weak convergence |19| . However, convergence in dw is always stronger than 
convergence in dBL and it is possible to have dBL — while dK {^(X), ^{Z)) > when 

the distribution of Z does not satisfy the assumptions of Proposition 1. Furthermore, the bounded Lipschitz 
metric does not scale as nicely as the Kolmogorov or Wasserstein distances when the associated random 
variables are multiplied by a positive constant. For the remainder of this paper, we will state our results 
in terms of dBL with the corresponding Kolmogorov bound being implicit therein, though one should note 
that, as with the Wasserstein bound on dj^, Kolmogorov bounds obtained in this fashion are not necessarily 
optimal and often give the root of the true rate. 

Now, in light of Lemma 1, we are led to consider the solution to the initial value problem 

g{x) - b^g"{x) = h{x) - Wh, g{0) = 

5 



where h G Hbl and Wh = E[h{W)], W ~ Laplace(0, 6). For the sake of compactness, we will write 

h{x) := h{x) - Wh. 

The general solution to the homogeneous equation g"{x) — b^^g{x) = is given by go{x) = de^ +C2e~^ , 



so, smce 



e t 
1 



= — I ^ 0, it follows from the variation of parameters method that a solution to 
the inhomogeneous equation g"{x) — b~'^g{x) = —b~'^h{x) is given by 



where 



9h{x) = vl{x)e^ +vl{x)e ^ 
viix) = ^ J e-ih(y)dy and vl{x) = eih{y)dy. 



To see that the initial condition is satisfied, we observe that 

1 r°° I i~ f°° 

gniO) = viiO) + vUo) = -J e-^h{y)dy = fw{y) {h{y) - Wh) dy 

/CO pOO 
h{y)fw{y)dy -Wh fw{y)dy = Wh-Wh = 0. 
-co J — oo 

Taking derivatives yields 

d'hix) = vl' {x)e^ + ^vl{x)e^ +vl'{x)e-^ - ^vl{x)e-^ 

^{x) + \vl{x)eT^ - ^h{x) - ^v^(a;)e-^ = ^ {vl{x)e^ - vl{x)e-^) 



2b 



and 



9hi^) = ^ {-h{x) + vl{x)et +vl{x)e ?) , 
9h{x) = -^h'{x) + 1 {vl{x)ei - vl{x)e-^) . 



1,2 y J ' fj3 

(The expression for 5^' may also be obtained by noting that gh satisfies gh{x) — b'^g'l{x) = h{x) and the 
expression for g'^^' then follows by differentiating g'l{x) = {gh{x) — h{x) + Wh).) 



Moreover, as 



< 1, 



\Wh\ 



26 7-, 



h{x)e ~dx 



< 



2b 



e >> dx = 1, 



and thus 



and 



h{x) < \h{x)\ + \Wh\ < 2. Consequently, 

1 X f°° y 

\vl{x)e^\<—e^ J 2e-*dt/ = l 



1 X r^" 

\vlix)e~^\< —e-^ J 2eidy = l, 
for all a: e R, so, since < 1 as well, we have 



Lemma 2. For h € Hbl, W Laplace{0,b), h{x) := h{x) — Wh, a bounded solution to the initial value 
problem 

g{x) - b''g"{x) = h{x), g{0) = 



is given by 

This solution satisfies \\gh\\^ < 2, ||g|J|^ < |, \\g'^\\^ < ^ and ||.9^"||^ < Consequently, E\gh{X)\, 

E \g'f^{X)\, and E \gi[{X)\ exist for all X, and g'f^ are absolutely continuous, and g'/^ is Lipschitz with 

b+2 
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Lipschitz constant ^p^. 



With the preceding resuhs in hand, we are in a position to estabhsh that the equation from Lemma 1 
does indeed characterize the Laplace distribution. 

Lemma 3. If X is a random variable such that 

E[g{X)] - g{0) = b'E[g"{X)] 

for every absolutely continuous function g such that g' is absolutely continuous and E[g(X)] exists, then 
X ^ Laplace(0, b). 

Proof. Let W ^ Laplace(0, b) and, for h e Hbl, let gh be as in Lemma 2. Because gh{0) — and gh, g'l^, 5^' 
are bounded, it follows from the above assumptions that 

E[hiX)] - E[hiW)] = E[g^iX) - b'gUX)] = 0. 

Taking suprema over h e Hbl shows that dsL iJ^{X), ^{W)) =0. □ 

Therefore, X ^ Laplace(0,6) if and only if E[g{X)] — 5(0) = b'^E[g"{X)] for every absolutely continuous 
function g such that g' is absolutely continuous and exists. 



3. Bounding the Error 

Our next task is to use this characterization to obtain bounds on the error terms resulting from approx- 
imation by the Laplace distribution. To this end, we introduce the following definition: For any random 
variable X with finite second moment, we say that the random variable X^ has the symmetric equilibrium 
distribution with respect to X if 

E[f{X)] - /(O) = ^E[XW{X^)] 

for all functions / such that / and /' are absolutely continuous and E |/(^)| ,E \ f'{X)\ < 00. We call the 
map X I— >■ X^ the symmetric equilibrium transformation. (The nomenclature is based on similarities with 
the equilibrium distribution from renewal theory which was used in a similar manner in [12j for a related 
problem involving the exponential distribution.) 

The utility of this definition is that if X is a random variable which is in the domain of the symmetric equi- 
librium transformation, then we can bound the distance between the law of X and the Laplace distribution 
with the following theorem. 

Theorem 1. If X is a random variable such that = 2b^ , and X^ exists, then 

dBwi^{X),Leip\ace(0,b)) < ^^-^E\X - X^\ . 

b ' ' 
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Proof. If X satisfies the assumptions of the theorem and has the symmetric equihbrium distribution for 
X, then for ah h E Hbl, taking gh as above, we see that 

\Wh - E[h{X)]\ = \E[g,{X) b'g'aX)]\ = - b'g'^X)]] 

< b'E \g'^{X^) g',:{X)\ < b' |K|| E\X^-X\ 

-'4^E\X-X-\. 
b ' ' 

□ 

In addition, we have the complementary result 
Proposition 2. IfY^ has the symmetric equilibrium distribution for Y where E[Y'^] = 26^, then 

E\Y-Y'^\<E\Y\ + -^E[\Y\% 
Proof. The inequality holds trivially if £^[|yp] — oo. Otherwise, we have 

e\y~y^\< (e\y\ + e\y^\) = e\y\ + -^E[\Y\^], 

where the equality follows upon taking f{y) = [yf^ in the definition of the transformation Y i— > Y^. □ 

At this point, we need to show that the domain of the symmetric equilibrium transformation is large 
enough for these results to be of use. In order to do so, we appeal to the following theorem due to Larry 
Goldstein and Gesine Reinert concerning generalizations of the size bias and zero bias transformations [7] . 

Theorem 2 (Goldstein and Reinert). Let X be a random variable, m € {0,1,2,...}, and P a measurable 
function with exactly m sign changes, positive on its rightmost interval, such that 

-^E[X^P{X)] = a5k,m, fc = 0, 1, m 
m! 

for some a > 0. Then there exists a unique distribution for a random variable X^^^ such that 

E[P{X)F{X)] = aE[F^"'Hx<-^^)] 

for all F e r"\X,P) = {F : F^™) exists and is measurable, E\P{X)F{X)\ < oo}. We say that X(^) has 
the X — P biased distribution. 

Remark. The proof of Theorem 2 shows that in the definition of J^™ {X, P) , it suffices to assume that 

is absolutely continuous and F*^™' exists on a set S with ¥{X e S*} = 1, and we will assume this in what 

follows. Thus, for example, the absolute value function lies in J-^{X, P) as long as E \XP{X)\ < oo. 

The basic idea of our existence argument is to iterate the mapping X X^-^^ for a suitable function P 
in much the same way as we showed that we could derive the second order Stein equation for the Laplace 
distribution by iterating the density method. The idea of composing distributional transformations appears 
in |13| as well, and their results also involve a second order characterizing equation. The precise statement 
of our result is 

Theorem 3. Let X be any random variable satisfying E[X] = 0, F{X < 0} = V{X > 0} = ^, and 
E[X'^] < oo. Then there exists a random variable X^ which satisfies E[f{X)] - /(O) = ^E[X'^]E[f" (X^)] 
for all functions f such that f, f are absolutely continuous and E \ f{X)\, E\f'{X)\ exist. 
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Proof. Taking P{x) = sgn{X) = < ' , it follows from the assumptions on X that 

I 1, a; > 

^E[X^P{X)] = £;[sgn(X)] = and ^E[Xhgn{X)] =E\X\:=a> 0, 

thus, by Theorem 2, there exists a distribution for a random variable X'-^^ with E[P{X)F{X)] = aE[F'{X'-^^)] 
{oj:allFeJ^^{X,P). 

Since ^ \x\ = sgn(a;) for x ^0, and F{X = 0} = by assumption, it follows from the definition of X^^^ 
that 

£[sgn(X(^))] = ^E[sgn(X) \X\] = ^E[X] = 0, 
and similarly, since ^ ^^sgn(a;)^ = \x\, 

£;[X(^)sgn(X(^))] = = ^E[sgn{X)^Xhgn{X)] = ^E[X^] := /3. 

As such. Theorem 2 asserts the existence of a random variable, which we will call X^, that has the X^^^ — P 
biased distribution. 

Therefore, if / is any function satisfying the assumptions in the statement of the theorem, setting g{x) = 
sgn{x) (fix) - /(O)) e T^X, P), we have 

E[f{X)] - /(O) = E[sgn{X)g{X)] = aE[g' {X^''^)] = ai?[sgn(X(^))/'(X(^))] 
= a/3^[/"(X^)] = ^EiXWi^"-)]- 

□ 



Theorem 3 can also bo derived directly from the Riesz representation theorem using the same general 
argument, but Theorem 2 was the original inspiration and it spares us some of the technical details involved 
in a direct proof. 

It is worth mentioning that the choice of P{x) = sgn(a;) for constructing a transformation suited to the 
Laplace(0, b) distribution has connections with the density method for constructing an appropriate Stein 
equation in that — — ■isgn(x). This connection is also evident in the zero bias transformation which 

is used in normal approximation. In general, if one is trying to construct a first order transformation suited 
to a random variable with density p{x), then a good initial guess for choosing P{x) in the definition of the 
X — P biased transformation is to take P{x) = — (or some constant multiple thereof). This makes 
sense because if E[F'{X) + ^^^^^ F{X)] = 0, then X is a fixed point of the transformation X X^^^ with 
E[—a^^^F{X)] = aE[F'{X^^^)]. The present work shows that it may sometimes be useful to iterate these 
heuristics to construct higher order Stein equations and distributional transformations. 

We conclude this section by mentioning some properties of the symmetric equilibrium distribution. Here 
X is taken to be a random variable in the domain of the symmetric equilibrium transformation wliicli has 
mean and variance 26^. To begin with, if X has moments of all orders, then by considering the function 
f{x) = a;'^+^, we see that the moments of are given by 
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of X by taking gt{x) ~ — t^^e**^ in the definition of X^. 

In addition, since the proof of Theorem 2 is constructive, one can determine the form of X^ without 
resorting to characteristic function arguments. Specifically, using the notation of Theorem 3, one can show 
that X'--^^ is of the form UiY where Ui and Y are independent with Ui ^ Uniform(0, 1) and Y having density 



Finally, since the characterizing equation involves second derivatives and a variance term, one expects some 
kind of relation between X^ and the zero bias distribution for X (defined by E[Xf{X)] = E[X'^]E[f' {X')] 
for all absolutely continuous / for which these expectations exist), in much the same way as the equilibrium 
distribution is related to the size bias distribution p^. Though we cannot express X^ directly in terms of 
X^, a similar argument shows that if [/ ~ Uniform(0, 1) is independent of all else, then 



for all functions / such that /, /' are absolutely continuous and the above expectations exist. We have not 
been able to make much use of the preceding results in our investigations of geometric sums, but they may 
come in handy for other applications involving the Laplace distribution. 



The p- geometric summation of a sequence of random variables Xi, X2, ... is defined as Sp = Xi + X2 + 
... + XjVp where Np is geometric with success probability p - that is, P{Np = n} = p(l — p)"^^, n E N - and is 
independent of all else. A result due to Renyi [14J states that if Xi,X2, ... are i.i.d., positive, nondegenerate 
random variables with = 1, then ^{pSp) — > Exponential 1) as p — > 0. In fact, just as the normal 

law is the only nondegenerate distribution with finite variance which is stable under ordinary summation 
(in the sense that if X,Xi,X2t.. are i.i.d. nondegenerate random variables with finite variance, then for 
every n G N, there exist a„ > 0, 5„ S R such that X =d an {Xi + ... + X„) + 6„), it can be shown that if 
X, Xi, X2, ... are i.i.d., positive, and nondegenerate with finite variance, then there exists Op e M such that 
ftp (Xi + ... + ^TVp) =d X for all p e (0, 1) if and only if X has an exponential distribution. Similarly, if 
we assume that Y,Yi,Y2, ... are i.i.d., symmetric, and nondegenerate with finite variance, then there exists 
ftp € M ftp (Yi + ... + i^TVp) —d Y for all p G (0, 1) if and only if Y has a Laplace distribution. Moreover, it 
must be the case that Op = p^. In addition, we have an analog of Renyi's Theorem [9]: 

Theorem 4. Suppose that Xi, X2, ... are i.i.d. symmetric and nondegenerate random variables with finite 
variance cr^, and let Np ^ Geometric{p) be independent of the X-s. // 

apy^ Xi -^d X as p ^ 0, 




fviv) — f^iy)' ^^^'^ -^L is of the form U2Z where U2 and Z are independent with U2 ^ Uniform(0, 1) 
and Z having density fz{z) = (j-) [ /x(-p) (^) = "ElX^f \^\fx(p){z) Ui- Consequently, we may write the 
density oi X^ as 





4. Random Sums 




i=l 
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then there exists 7 > such that Up — pij + o{pi) and X has the Laplace distribution with mean and 
variance (7^7^. 



A recent theorem due to Alexis Toda gives the following Lindeberg-type conditions for the existence of 
the distributional limit in Theorem 4. 



Theorem 5 (Toda). ief Xi, A2, ... be a sequence of independent (but not necessarily identically distributed) 
random variables such that E[Xi] = and Var(A'i) = af , and let Np ^ Geometric(p) independent of the 
X^s. Suppose that 

lim n^"(T^ — for some < a < 1 , 

n— f 00 



:— lim — af > exists, 

1=1 

and for all e > 0, 

00 

\un T{l-py-'pE[Xf; \X,\ > ep'i] = 0. 
Then as p 0, the sum p^ X^t^i -^i converges weakly to the Laplace distribution with mean and variance 



Remark. The original statement of Toda's Theorem is slightly more general, allowing for convergence to a 
possibly asymmetric Laplace distribution, but we will ignore that version for the moment as we state the 
result only for the sake of comparison. 

In 2011, Pekoz and Rollin were able to generalize Renyi's theorem by using a distributional transformation 
inspired by Stein's method considerations [12j. Specifically, for a nonnegative random variable X with E[X] < 
00, they say that X'^ has the equilibrium distribution with respect to X if E[f{X)] — /(O) = E[X]E[f'{X'^)] 
for all Lipschitz / and use this to bound the Wasserstein and Kolmogorov distances of a distribution to 
the Exponential 1) distribution. (The equilibrium distribution arises in renewal theory, but its utility in 
analyzing convergence to the exponential distribution comes from the fact that a Stein operator for the 
exponential distribution with mean one is given by {AEf){x) = f'{x) — f{x) + /(O), so X ^ Exponential 1) 
is a fixed point of the equilibrium transformation |15).) This transformation and the similarity between 
our Stein characterization of the Laplace and the above characterization of the exponential inspired our 
construction of the symmetric equilibrium transformation, and the fact that both distributions are stable 
under geometric summation led us to parallel their argument for bounding the distance between p-geometric 
sums of positive random variables and the exponential distribution in order to obtain corresponding results 
for the Laplace case. Our results are summarized in the following theorem and its corollaries. 



Theorem 6. Let N be any N-valued random variable with fi = E[N] < 00 and let Xi,X2, ... be a sequence 
of independent random variables, independent of N , with P{Xi < 0} = P{A,j > 0} = i, ^^[A'.i] = 0, and 

. 21 



E\Xf] ^ at < 00. Set = E 



(Ef=r 



X, 



= E 



and let M be any N-valued random variable, 



defined on the same space as N , satisfying 

P{M = to} 
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'-FIN > to}. 



Then 



f if (Ai"^ ^Xi), Laplace ^0, j 



NVM 



{Xm - Xl,) + sgn(7V - M) ^ Xi 

i={NAM) + l 



< — 1 [E\X_ 



-E 



\Xm\ 



'M 



+ su^aiE[\N -M\-^] . 

i>l 



Proof. We first note that 



m=l 



SO M is well-defined. 



Now. taking V = fjr^ Sili -^ii '^^ claim that T^^ = fjri {^i'Li ^ + ^iv/j ^^^^ symmetric equilib- 
rium distribution with respect to V. To see this, let / be any function satisfying the assumptions of Theorem 
3. Then, using the notation 

m 

letting denote the distribution of 



m=l 



1 



'm-1 



:=E^^ 



and observing that, by independence, 

E[h"{X^)\Srr._,=s]=E[h"{X^)] 

for all suitable functions h, we see that 

m— 1 



E 



r{t,-ij2Xi+n-ix^) 



4- - h{0)]) = -^EihiXrr,) - h{0)\Sm-l = s], 



j E [fiKiX^) \Sm-l = S] dVm{s) 

^ [e [hsiXm) - K{0) \Sm-l = S] dVm{s) 

^ I E f{n-h + n-iXm)- f{fi~is)\Sm-i = s dPm{s) 

m m—1 

/(m-^E^')-/('""^E^*) 



2/x 

(7; 



i=l 
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for all m G N, hence 



oo m — 1 

m=l i=l 

= 51 ^^{M = m}E [g{m) - g{m - 1)] 

^ -. 

oo 

^ ¥{N > to} (^(to) - g{m - 1)) 



2 



■E 



_m=l 



= (£;[5(Ar)] - 5(0)) = (i5[/(y)] - /(O)) . 

Consequently, setting 2B^ = E[V^] = it follows from Theorem 1 that 
dsL(=Sf(F),Laplace(0,B)) < (l + E \V - V"-] 



/X ^ + — \ E 



NVM 



{Xm - X^) + sgn(7V - M) ^ X, 

i=(ArAM)+l 



AfVM 
i=(JVAM)+l 



By Proposition 2, we have 

oo 
m=l 

°° / 1 r 

< ^P{M = m}f£|X„| + — 

m=l ^ 



= ^|XM| + -i; 



M 



and, since the X[s are independent, it follows from the Cauchy-Schwarz inequality 



that 



E 



NVM 


CX) CSO 








j+k 






|7V- 


M\ 


= k}E 


E 


CiVAM)+l 


j=i k=i 












OO oo 

<EE^{^^^^=j' 


\N~ 


M\ 


= k}E 


E 
















oo oo 

<EE^{^/^^=j' 


\N- 


M\ 


= k}ki 


SUpCTi 
1 



j=l fe=l 



oo 

= supcTi V A:^P{|7V - M| = A:} = supcr»i;[|7V - M\^]. 



□ 



Specializing to the i.i.d. case, so that = fxaf , we have the immediate corollary 
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Corollary 1. Let N be any N-valued random variable with /i = E[N] < od and let Xi,X2, ... be a sequence 
of i.i.d 

and E 



of i.i.d. random variables, independent of N , with ¥{Xi < 0} = ¥{Xi > Q} ^ \, E[Xi] ^ 0, E[Xl] = 2h'^ , 



The 



— p < oo. Let M be any N-valued random variable, defined on the same space as N , satisfying 
V{M = m}^ fi-^¥{N > m}. 



dBL (^(m^^^X,), Laplace (0,6)] < ttl (^E \Xi\ + + bV2E[\N ~ M\^]^ 



1=1 



When N ^ Geometric(p) and the X-s satisfy the assumptions of Theorem 6 along with the additional 



constraint E[Xf] = 2b^ for alH > 1, we have M = ^ ^i^d 

2 ^ 

^¥{N > m} = pV{N > m} ^ p^ (I - = p(l - = P{iV = m}, 

i—m 

SO we can take M = in Theorem 6 to obtain 



dBL l^^b^^X,), Laplace (0,6)^ < P^'^^E\Xn - X^\ 



<pi^-±^fE\XM + lE 



-0- \^Nf 



'N 

Consequently, using Cauchy-Schwarz to bound E \Xn\, we have the following Berry-Esseen type theorem. 

Theorem 7. Let Xi,X2, ... be a sequence of independent random variables with ¥{Xi < 0} = P{Xi > 0} = 
^, E[Xi] — 0, E[Xf] = 2b^ , and supj>]^ E = p < oo, and let N ^ Geometric{p) be independent of the 

X'iS. Then 

/ N \ , „ 

P 



dBL (^{p^ ^ Xi), Laplace (0, 6) j < p^ ^ (bV2 



for all p e (0, 1). 



Theorem 7 gives sufficient conditions for weak convergence in the setting of Theorem 4. Though it requires 
that the X[s have uniformly bounded third absolute moments, the condition of symmetry is weakened to 
f'{Xi < 0} = f{Xi > 0} = ^ and the identical distribution assumption is reduced to the requirement that 
the X[s have common variance. This result is not as general as Theorem 5, but it does provide bounds on 
the error terms, which is important for practical considerations. 

5. Concluding Remarks 

In addition to providing a Stein characterization for the Laplace distribution along with bounds on the 
error terms from Laplace approximation, and using these to obtain criteria for the convergence of geometric 
sums to the Laplace distribution with an estimate of the convergence rate, the methodology employed in this 
paper is of interest in its own right. Firstly, our characterizing equation involves second derivatives and can be 
obtained by successive applications of the density method. The density method is perhaps the most common 
means of finding a characterizing equation [3] , and it is intriguing that the procedure can be iterated to obtain 
useful distributional characterizations involving higher derivatives. In the case of the Laplace(0, h) distribu- 
tion, the density method suggests the Stein operator {ALg){x) = g'{x) — j^sgx\{x)g{x), but we were unable to 
get much mileage out of this characterization while the second-order operator {ALg){x) — g{x)-~g{0)~b^g"{x) 



turned out to be quite effective. Perfiaps this will also prove to be the case for other distributions. Secondly, 
our work made use of Goldstein and Reinert's X — P bias transformations j7], which is a very neat theory 
that has not yet found many applications. It seems likely that as Stein's method techniques are applied to 
more and more classes of distributions, their results will be useful in constructing corresponding transfor- 
mations to aid in the calculations. Note also that in this context the present work demonstrates the utility 
of composing distributional transformations to obtain new ones. Thirdly, we hope that Proposition 1 serves 
to popularize the use of the bounded Lipschitz metric in the Stein's method community. We were unable to 
get useful results in the present setting when working directly with the Kolmogorov or Wasserstein distances 
because of problems with continuity and boundedness issues, but the computations were very nice and easy 
once we turned to dsL- Moreover, even when one can obtain suitable solutions to the Stein equation using 
dw, they are often much simpler if one assumes that the h are bounded as well. Finally, the fact that we 
were able to parallel the argument from [12J to find a complementary random variable which could be used 
in Stein's method calculations suggests that this technique may have broader applicability. Just as one can 
typically find the desired transformation of a sum of a fixed number of random variables by exchanging ap- 
propriately chosen summands with their transformations, it appears that one may often be able to deal with 
sums of random numbers of random variables by introducing a different random variable as the upper index 
of summation and then exchanging the last summand with its corresponding transformation. It would be 
interesting to see if this basic strategy works with more general Linnik laws, Mittag-Leffler distributions, and 
other geo-stable families as well (see [10]). One also hopes that these ideas can be used to find non-geometric 
random indices which give rise to similar limit theorems. 
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